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A nucleic acid which is upregulated in human tumor cells, a protein encoded thereby and 

a process for tumor diagnosis 

» Breast cancer is a major health problem since every eighth women in Europe and the US 

J succumbs to this disease (Moustafa, A.S., and Nicoison, G.L., Oncol. Res. 9 (1997) 505-525; 

5 

I Nicoison, G.L., Biochem. Soc. Symp. 63 (1998) 231-242). Treatment includes surgery, 

radiation, chemotherapy and combinations thereof, depending on the stage of the disease 

5 (Schwirzke, M„ et al., Anticancer Res. 19 (1999) 1801-1814). A pronounced tropism of 

* metastasis to the bones is characteristic for this disease starting with micrometastic lesions 

M f in the bone marrow which finally may outgrow to full-blown metastases. Bone metastasis 

! j result in bone fractures and spinal cord depression syndrome often followed by severe pain, 

( aberrant calcium homeostasis and finally lead to death of the' patients (Coleman, R.E., and 

| 10 Rubens, R.D., Br. J. Cancer 55 (1987) 61-66). 

I 

!§* Analysis of genes involved in breast cancer and in cancer in general revealed a dichotomy 

I with one category of genes with deregulated expression due to mutation and the other 

category of genes exhibiting changes in their regulation. These findings resulted in the 
grouping of cancer genes into two classes: class I genes are mutated or deleted, class II genes 
15 exhibit no alterations at the DNA level (Sager, R., Science 246 (1989) 1406-1412; Sager, R„ 
Proc. Natl. Acad. Sci. USA 94 (1997) 952-955). 

Summary of the invention 

In accordance with the present invention, a protein and the related gene, termed PKW, is 
provided which is upregulated in tumor cells, preferably in mammary tumor cells, as 
20 compared to their non-tumor counterparts. The PKW gene codes preferably for a 
^ polypeptide consisting of SEQ ID NO:2 or SEQ ID NO:4. 

The present invention provides a nucleic acid which is upregulated in tumor cells, 
4 especially in mammary carcinoma cells, and which codes for a polypeptide which induces 

tumor progression or metastasis, the nucleic acid being selected from the group consisting 
25 of. 

(a) SEQ ID NO: 1; 

^ (b) a nucleic acid sequence which hybridizes under stringent conditions with a nucleic 

acid probe of the complementary sequence of (a); 



Printed:07-02-2001 



1 



-4- 



The PKW polypeptide can occur in natural allelic variations which differ from individual to 
individual. Such variations of the amino acids are usually amino acid substitutions. 
However, they may also be deletions, insertions or additions of amino acids to the total 
sequence. The PKW polypeptide according to the invention - depending, both in respect of 
5 the extent and type, on the cell and cell type in which it is expressed- can be in glycosylated 
or non-glycosylated form. Polypeptides according to the invention can be identified by 
transfection of PKW-negative non-tumor cells with expression vectors for PKW, 
establishment of stable transfectants and evaluation of their tumor progression capacity 
after xenografting into nude mice. 

10 "Polypeptide with PKW activity or PKW 1 ' means also a protein with minor amino acid 
variations but with substantially the same PKW activity. ^Substantially the same" means 
that the activities are of the same biological properties and the polypeptides show at least 
90% homology (identity) in amino acid sequence. 

The term "nucleic acid molecule or nucleic acid" denotes a polynucleotide molecule which 
15 can be, for example, a DNA, RNA, or derivatized active DNA or RNA. DNA and/or RNA 
molecules are preferred, however. 

The term "hybridize under stringent conditions" means that two nucleic acid fragments are 
capable of hybridization to one another under standard hybridization conditions described 
in Sambrook et al., Molecular Cloning: A Laboratory Manual (1989) Cold Spring Harbor 

20 Laboratory Press, New York, USA. More specifically, "stringent conditions" as used herein 
refer to hybridization in 6.0 x SSC at about 45°C, followed by a wash. This wash can be 
with 2.0 x SSC at 50°C. Preferably, hybridization is performed using the commercially 
available Express Hyb™ Hybridization Solution of Clontech, which is a non-viscious 
solution containing no salmon sperm DNA. The stringency of the salt concentration in the 

25 wash step can be selected, for example, from about 2.0 x SSC at 50°C, for low stringency, to 
about 0.2 x SSC at 50°C, for high stringency. In addition, the temperature in the wash step 
can be increased from low stringency conditions at room temperatures, about 22°C, to high 
stringency conditions at about 65°C. 

The phrase "nucleic acid or polypeptide" as used throughout this application refers to a 
30 nucleic acid or polypeptide having a PKW activity which is substantially free of cellular 
material or culture medium when produced by recombinant DNA techniques, or 
substantially free of chemical precursors or other chemicals when synthesized chemically. 
Such a nucleic acid is preferably free of sequences which naturally flank the nucleic acid 
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I . (Le. sequences located at the 5 1 and the 3 f ends of the nucleic acid) in the organism from 

which the nucleic acid is derived. 

The polypeptides according to the invention can be produced by recombinant means, or 
synthetically. Non-glycosylated PKW polypeptide is obtained when it is produced 
& 5 recombinantly in prokaryotes. With the aid of the nucleic acid sequences provided by the 

|| invention it is possible to search for the PKW gene or its variants in genomes of any desired 

cells (e.g. apart from human cells, also in cells of other mammals), to identify these and to 
isolate the desired gene coding for the PKW proteins. Such processes and suitable 
hybridization conditions are known to a person skilled in the art and are described, for 
\ 10 example, by Sambrook et aL, Molecular Gloning: A Laboratory Manual (1989) Cold Spring 

Harbor Laboratory Press, New York, USA, and Hames, B.D., Higgins, S.G., Nucleic Acid 
Hybridisation - A Practical Approach (1985) IRL Press, Oxford, England. In this case the 
standard protocols described in these publications are usually used for the experiments. 

With the aid of such nucleic acids coding for a PKW polypeptide, the polypeptide 
15 according to the invention can be obtained in a reproducible manner and in large amounts. 
For expression in prokaryotic or eukaryotic organisms, such as prokaryotic host cells or 
eukaryotic host cells, the nucleic acid is integrated into suitable expression vectors, 
according to methods familiar to a person skilled in the art. Such an expression vector 
preferably contains a regulatable/inducible promoter. These recombinant vectors are then 
20 introduced for the expression into suitable host cells such as, e.g., E. coli as a prokaryotic 
host cell or Saccharomyces cerevisiae, Teratocarcinoma cell line PA-1, sc 9117 (Buttner et 
aL, Mol. Cell. Biol. 11 (1991) 3573-3583), insect cells, CHO or COS cells as eukaryotic host 
cells and the transformed or transduced host cells are cultured under conditions which 
allow expression of the heterologous gene. The isolation of the protein can be carried out 
25 according to known methods from the host cell or from the culture supernatant of the host 
cell. Such methods are described for example by Ausubel L, FredericK M., Current 
Protocols in Mol. Biol. (1992), John Wiley and Sons, New York. Also in vitro reactivation 
of the protein may be necessary if it is not found in soluble form in the cell culture. 

PKW polypeptide can be purified after recombinant production by affinity 
30 chromatography using known protein purification techniques, including 
immunoprecipitation, gel filtration, ion exchange chromatography, chromatofocussing, 
isoelectric focussing, selective precipitation, electrophoresis, or the like. 
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membranes or carrier materials based on nitrocellulose (e.g., Schleicher and Schiill, BA 85, 
Amersham Hybond, C.)> strengthened or bound nitrocellulose in powder form or nylon 
membranes derivatized with various functional groups (e.g., nitro groups) (e.g., Schleicher 
and Schiill, Nytran; NEN, Gene Screen; Amersham Hybond M.; Pall Biodyne). 

5 Hybridizing DNA or RNA is then detected by incubating the carrier with an antibody or 
antibody fragment after thorough washing and saturation to prevent unspecific binding. 
The antibody or the antibody fragment is directed towards the substance incorporated 
during hybridization to the nucleic acid probe. The antibody is in turn labeled. However, it 
is also possible to use a directly labeled DNA. After incubation with the antibodies it is 

10 washed again in order to only detect specifically bound antibody conjugates. The 
determination is then carried out according to known methods by means of the label on 
the antibody or the antibody fragment 

The detection of the expression can be carried out for example as: 

in situ hybridization with fixed whole cells, with fixed tissue smears, 
15 - colony hybridization (cells) and plaque hybridization (phages and viruses), 
Southern hybridization (DNA detection), 
- Northern hybridization (RNA detection), 

serum analysis (e.g., cell type analysis of cells in the serum by slot-blot analysis), 
after amplification (e.g., PCR technique). 

20 

Therefore the invention also includes a method for the detection of carcinoma cells, 
comprising 

a) incubating a sample of a patient suffering from cancer, selected from the group of 
body flir*'. of cells, or of a cell extract or cell culture supernatants of said cells, 
25 whereby said sample contains nucleic acids with a nucleic acid probe which is 

selected from the group consisting of 

(i) the nucleic acid shown in SEQIDNOrl or a nucleic acid which is 
complementary to said sequence, and 

(ii) nucleic acids which hybridize with one of the nucleic acids from (i) and 

30 b) detecting hybridization, preferably by means of a further binding partner of the 
nucleic acid of the sample and/or the nucleic acid probe or by X-ray radiography. 
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Preferably, the nucleic acid probe is incubated with the nucleic acid of the sample and the 
hybridization is detected optionally by means of a further binding partner for the nucleic 
acid of the sample and/or the nucleic acid probe. As probes, nucleic acids of SEQ ID NO:8 
v2 , and SEQ ID NO:9 are preferred. 

i 

J$ 5 The nucleic acids according to the invention are hence valuable markers in the diagnosis 

| and characterization of tumors, especially of mammary tumors. 

The invention further comprises a method for producing a protein whose expression is 
correlated with tumors, by expressing an exogenous DNA in prokaryotic or eukaryotic host 
^ cells and isolation of the desired protein, wherein the protein is coded by the nucleic acid 

10 molecules according to the invention, preferably by the DNA sequence shown in 
SEQ ID NO:l. 

The protein can be isolated from the cells or the culture supernatant and purified by 
^ chromatographic means, preferably by ion exchange chromatography, affinity 

chromatography and/or reverse phase HPLC. 

15 The invention further comprises a protein according to the invention which is encoded by 
a nucleic acid molecule according to the invention, preferably having the nucleotide 
sequence set forth in SEQ ID NO:l. 

The present invention relates to the cloning and characterization of the gene PKW, which is 
especially characterized as a tumor progression gene, and as an upregulated gene indicative 
20 for the tumor progression potential of tumor cells, preferably of mammary tumor cells. 

According to the invention inhibitors for the expression of PKW (e.g., antisense 
^ nucleotides) can be used to inhibit tumor progression, preferably of mammar/ carcinomas, 

in vivo. 

-fe? The invention further provides methods for identifying and isolation of antagonists of 

25 PKW or inhibitors for the expression of PKW (e.g. antisense nucleotides). Such antagonists 
or inhibitors can be used to inhibit tumor progression and cause massive apoptosis of 
tumor cells in vivo. 

^ According to the invention there are provided methods for identifying and isolation of 

compounds which have utility in the treatment of cancer. These methods include methods 
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for modulating the expression of the polypeptides according to the invention, methods for 
identifying compounds which can selectively bind to the proteins according to the 
invention, and methods of identifying compounds which can modulate the activity of said 
polypeptides. The methods further include methods for modulating, preferably inhibiting, 
5 the transcription of PKW gene to mRNA. These methods can be conducted in vitro or in 
vivo and may make use of and establish cell lines and transgenic animal models of the 
invention. 

A PKW antagonist is defined as a substance or compound which decreases or inhibits the 
biological activity of PKW, a polypeptide and/or inhibits the transcription or translation of 
10 PKW gene. In general, screening procedures for PKW antagonists involve contacting 
candidate substances with host cells in which invasiveness is mediated by expression of 
PKW under conditions favorable for measuring PKW activity, 

PKW activity may be measured in several ways. Typically, the activation is apparent by a 
change in cell physiology, such as increased mobility and invasiveness in vitro, or by a 
15 change in the differentiation state, or by a change in cell metabolism leading to an increase 
of proliferation. 

As shown in Fig 1, gene PKW is expressed only in one of the primary carcinoma cell lines, 
the one derived from the medullary mammary carcinoma. Topology of small and large 
transcripts of gene PKW as well as the potential proteins encoded by them are outlined 

20 schematically ii: Fig 2. Small and large transcript of gene PKW share 723 bp at the 5'end 
and 512 bp at the 3'end. The large transcript contains an insertion of 1 107 bp (Fig 2A). The 
small transcript is due to differential splicing of the large transcript (bp 459-723 and 
1831-1850 of SEQ ID NO:l). The small transcript encodes a potential protein of 95 aa, the 
large transcript exhibits an open reading frame of 130 aa. Both potential proteins share an 

25 open reading fr^me of 88 aa with exception of one different aa at position 43 (nucleic acid 
position 586 of SEQ ID NO:l), followed by extensions of 7 aa and 42 aa for the smaller and 
the larger protein in different reading frames. The difference at position 43 may be a PGR 
artefact or could be caused by polymorphism. The 95 aa protein exhibits an isoelectric 
point (IEP) of 11.2, the IEP of the 130 aa protein is 10.4. These findings point to a nuclear 

30 localization of the proteins encoded by gene PKW. As shown in Fig 3 transcripts of gene 
PKW were deleted only in salivary gland, not in other adult human tissues and in a small 
panel of embryonic tissues such as fetal brain, heart, kidney, liver, spleen, thymus and lung. 
Promyelocytic leukemia cell line HL-60, HeLa cells, chronic myelogenous leukemia cell line 
K-562, lymphoblastic leukemia cell line MOLT-4, Burkitt lymphoma cell line Raji, 
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colorectal adenocarcinoma cell line SW 480, lung carcinoma cell line A549 and melanoma 
cell line G361 all scored negative with respect to mRNA for gene PKW (Fig 3). The probe 
used for hybridization detects the small as well as the large transcript of gene PKW. A panel 
of mammary carcinoma cell lines described in (Schwirzke, M., et al., Anticancer Res. 18 
5 (1998) 1409-1421) also tested negative with respect to the mRNA of gene PKW by 
Northern blotting as well as RT-PCR. These include MDA-435 (Cailleau, R., et al., In Vitro 
14 (1978) 911-915) derived subclones 4C4 and 2A5, cell lines MDA-MB231 (Cailleau, R., et 
al., J. Natl. Cancer Inst 53 (1974) 661-674), MDA-MB436 (Cailleau, R., et al., In Vitro 14 
(1978) 911-915), ZR-75(Engel, L.W., et al., Cancer Res. 38 (1978) 3352-3364), T47D 

10 (Freake, H.C., et al., Biochem. Biophys. Res. Commun. 101 (1981) 1131-1138), Hs578 T 
(Hackett, A.J., J. Natl. Cancer Inst. 58 (1977) 1795-1806), MCF-7 (Schiemann, S., et al., 
Anticancer Res. 17 (1997) 13-20; Schiemanny S., et al., Clin. Exp. Metastasis 16 (1998) 129- 
139), MCF-7adr (Schiemann, S., et al., Anticancer Res. 17 (1997) 13-20; Lee, J.H., et al., 
Biochem. Biophys. Res. Commun. 238 (1997) 462-467), LCC-1, LCC-2 and LCC-9 

15 (Brunner, N., et al., Cancer Res. 53 (1993) 283-290; Briinner, N., et al., Cancer Res. 53 
(1993) 3229-3232). In summary 11 mammary carcinomas were analyzed for expression of 
the small transcript of gene PKW by RT-PCR. 4 carcinomas scored positive. Parts of the 
results are displayed in Fig 4. Two of the positive carcinomas corresponded to ductal 
carcinomas and the other two matched with lobular carcinomas. 



20 The following examples, references, sequence listing and figures are provided to aid the 
understanding of the present invention. It is understood that modifications can be made in 
the procedures set forth without departing from the spirit of the invention. 



25 



30 



SEQIDNCM: 

SEQIDNO:2 

SEQIDNO:3 

SEQIDNCM 

SEQIDNO:5 

SEQIDNO:6 

SEQIDNO:7 

SEQIDNO:8 

SEQIDNO:9 

SEQIDNO:10 

SEQIDNO:ll 



cDNA of PKW gene and amino acid sequence of large PKW splice 
variant 

Amino acid of large PKW splice variant. 

cDNA and amino acid sequence of small PKW splice variant 

Amino acid of small PKW splice variant. 

Primer GSP1 

Primer GSP2 

Primer AUAP 

Primer RTR-5 

Primer RTF-6 

6-actin reverse primer 

fi-actin forward primer 



35 
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Figure 1 Northern blot revealing differentially expressed mRNA's in cell lines 

derived from normal human mammary gland and from breast 
carcinomas of distinct stages of progression. 

E: RNA was extracted from confluent cell lines, separated on a denaturing 
1% agarose-formaldehyde gel, transferred to a positively charged nylon 
membrane and hybridized to an [a- 32 P] -labeled probe corresponding to 
the appropriate subcloned fragment as revealed by the Differential 
Display Technique. 

Lane a: HMEC, normal Human Mammary Epithelial Cells; lane b: cell 
* line AR derived from, medullary mammary carcinoma; lanec: cell line 
WA derived from invasive ductal mammary carcinoma; lanes d, e and f: 
cell lines 1590, HG15 and KM22, derived from mammary carcinoma 
bone marrow micrometastases; lane g: metastatic mammary carcinoma 
cell line KS, derived from malignant ascites fluid. 

Figure 2 Schematic outline of transcripts of gene PKW as well as potential proteins 

encoded by these transcripts. Corresponding regions and domains are 
highlighted by conserved symbols. 

Figure 3 Multiple Tissue Array 

Normalized poly A+ RNA from different tissues and cell lines were 
hybridized to a 32P-labeled probe derived from gene PKW. E6 
corresponds to 1 fig and H6 to 0.1 (Xg poly A + RNA from cell line AR. The 
code is revealed below. No expression of PKW gene can be detected in 
these tissues and cell lines. 

Figure 4 Detection of transcripts of gene PKW in mammary carcinomas by 

RT-PCR. 

RNA was extracted from mammary carcinomas and analyzed for 
transcripts corresponding specifically to the small transcript of gene PKW 
as described in Example 9. 

Lane 0: DNA Molecular Weight Marker XIV (Boehringer Mannheim); 
I: cell line AR; II-IX: different mammary carcinomas samples; 
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lane a: p actin control (2.5 jxg RNA + specific primers for p actin); lane b: 
(2.5 |0g RNA + specific primers for gene PKW); lane c: negative controls 
without RT: 2.5 ng RNA + specific primers for gene PKW. 

^ An aliquot (15 |il) of the PCR products was analyzed on a 1.5% agarose 

5 gel. The bands specifically corresponding to mRNAs for gene PKW 

$ (137 bp) and fi-actin (587 bp) are depicted by arrows. 

I 

Example 1 

Cell lines and cell culture 

10 Human mammary epithelial ceDs (HMEC) were obtained from Bio Whittaker, Heidelberg, 
Germany. Primary mammary carcinoma cell lines AR and WA were obtained by 
fragmenting the primary tumor with scissors, treatment with collagenase (0.2 mg/ml in 5% 
FCS) and finally the cellular fraction was isolated by Ficoll gradient technique. Tumor cells 
were selected with a monoclonal antibody directed against MUC-1 coated to 
|| 15 Dynabeads®(Dynal, Norway). MUC-1 antibodies are described in WO 99/40881. 5 x 10 7 

*V beads were mixed with 10 7 cells, incubated for lh at 4°C on a roller device, the bead 

fraction was collected on a magnet, washed twice with DMEM and finally the cells were 
propagated in 75cm 2 culture flasks in DMEM supplemented with 10%FCS. Two clones 
were identified after 4 weeks for both cell lines referred to as AR and WA. Cell line AR is 

20 derived from an invasive medullary mammary carcinoma, cell line WA is derived from an 
invasive ductal carcinoma. Cell lines 1590, HG15 and KM22 are derived from bone marrow 
micrometastases of mammary carcinoma patients. Cellular fraction was isolated on a Ficoll 
gradient, erythrocytes were lysed and the cells were suspended in DMEM + 10%FCS and 
tumor cells were isolated on Dynabeads® coupled with MUC-1 antibody as described 

25 above. The bead fraction was cultivated in DMEM + 10%FCS, 10 fig/ml insulin and 
10 |xg/ml transferrin. Outgrowth of tumor cell lines was observed after 8 weeks. Clones 
( g were isolated by treatment with EDTA and propagated under standard conditions as 

described above. Cell line KS was isolated from malignant ascites fluid of a mammary 
carcinoma patient. 2000 ml of ascites were collected and the cellular fraction was isolated 
?* 30 on a Ficoll gradient. 2 x 10 7 cells were seeded into 750 cm 2 culture flasks. Tumor ceD 

clusters were obtained from the culture supernatants in order to separate them from 
adherently growing fibroblasts and mesothelial cells (passages 1-4). Passages 5-10 resulted 
in cultures growing partly as a monolayer and in suspension. Finally cells were propagated 
as a monolayer in DMEM + 10% FCS. 
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mRNA Differential Display PCR 

Differential display reverse transcriptase polymerase chain reaction (DD-RT-PCR) was 
5 performed following the method described by Liang and Pardee using the RNAimage™ kits 
(GenHunter Corp. Brookline, MA) according to the manufacturer's protocol. 

Total RNA was isolated from frozen cell pellets of all cell lines listed above making use of 
the RNeasy Midi® Kit (Qiagen, www.qiagen.de). Chromosomal DNA was removed from 
RNA samples by digestion at 37°C for 30 min with RNAse-free DNAse I using the 
10 MessageClean Kit® (GenHunter Corp. Brookline, MA). 

RNA was used as a template for first strand cDNA synthesis in the presence of 3 different 
one-base anchored oligo-dT primers (H-TnM, where M may be G; A or C). 



For a 20 pi reaction, 1 pi of DEPC-treated H 2 0, 4 pi of 5x reverse transcriptase buffer 
[125 mM Tris-Cl, pH 8.3, 188 mM KC1, 7.5 mM MgCl 2 , 25 mM dithiothreitol (DDT)], 

15 10 pi of dNTP mix [250 pM each], 2 jil of H-TnM primer [2 pM], and 2 pi of DNA-free 
total RNA sample [0.1 pg/pl] were mixed. The solution was heated to 65°C for 5 min and 
cooled to 37°C for 10 min, and 1 (xl [100 units] of Moloney murine leukemia virus 
(MMLV) reverse transcriptase was added. After incubation at 37°C for lh, the reaction was 
terminated by incubation at 75°C for 5 min. The following PCR procedure was performed 

20 in a 20 pi reaction, containing 2 pi of reverse transcription reaction mixture, 9.2 pi of 
DEPC-treated H 2 0> 2 pi pf lOx PCR buffer [100 mM Tris-Cl, pH 8.4, 500 mM KC1, 15 mM 
MgCl 2 , 0.01% gelatine], L6 pi of dNTP mix [25 pM each], 2 pi of the respective H-T n M 
primer [2 pM], 2 pi of an arbitrary 13-mer primer, [2 pM], 2 pi of OC-[ 35 S]dATP [>1000 
Ci/mmole] and 0.2 pi of AmpliTag [10 units/pl] DNA polymerase (Perkin Elmer, Norwalk, 

25 CT). PCR included a total of 40 cycles at 94°C for 30s, 40°C for 2 min, 72°C for 30 s, and 
finally 5 min at 72°C. After adding 2 pi loading buffer to 3.5 pi of each sample, the PCR 
products of all cell lines were heated at 80°C for 2 min and loaded in parallel on a 
denaturing 6% polyacrylamide sequencing gel for electrophoresis. The dried gel was 
exposed to BioMax™ MR film (Kodak) at room temperature for 24 to 48 h and the 

30 autoradiogram was analyzed for the differentially expressed genes. Bands corresponding to 
cDNA's of interest reproducibly displayed in two independent DD-RT-PCR reactions were 
excised from the dried gel, and the cDNA was eluted from the gel by soaking the gel slice in 
100 pi of dH 2 0 of 10 min and boiled for 15 min. After addition of 10 pi of 3M NaOAc and 
5 pi glycogen [lOmg/ml] as carrier the cDNA-fragments were recovered by precipitation 
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with 450 |uQ of ethanol and redissolved in 10 \jH dH 2 0. 4 fd eluted cDNA was reamplified in 
a second PCR using the same primer set and conditions except the dNTP concentrations of 
20 jiM each and no radioistope. As a control, gel slices were excised from lanes without 
visible bands on a level with the detected cDNA fragments of interest and treated as 
5 described above. The amplified PCR products obtained were analyzed on a 3% NuSieve® 
GTG (FMC BioProducts, Rockland), agarose gel, then purified using the QIAquick™ Gel 
Extraction kit (Qiagen, DE) and used as probes for Northern analysis. 

DNA sequencing of DD-RT-PCR fragments 

10 All PCR fragments of interest were sequenced directly after extraction and purification 
from agarose gels. The nucleotide sequences data were analyzed for homologies with 
known genes or ESTs in the current DNA data bases. 



4;' Northern blot analysis 



15 Poly A + -RNA was isolated from total RNA 1 \ig of polyA + -RNA from HMEC, AR, WA, 
1590, KM22, HG15 and KS cells were loaded side by side on a denaturing 1% agarose 
formaldehyde gel and then size-separated by electrophoresis. Blotting to positively charged 
nylon membrane was done by capillary downward transfer. After UV-crosslinking 
(Stratagene UV Stratalinker™ 2400, www.stratagene.com) blots were hybridized. For that 

20 the DD-RT-PCR products were labeled with a- [ 32 P] dATP up to a specific activity of 2x 10 9 
cpm/ng. Prehybridization (30 min) and hybridization (over-night) with radioactive probes 
were performed in ExpressHyb™ hybridization solution (Clontech, www.clontech.com) at 
68°C according to the manufacturer's recommendation. Membranes were washed in 
solution 1 (2;: SSC, 0.05% SDS) at room temperature for 30-40 min with continuous 

25 agitation and several replacements of the wash solution 1 followed by a washing step with 
solution 2 (0.1 x SSC, 0.1% SDS) at 50°C for 40 min with one change of fresh solution. The 
membranes were then exposed to Cronex™, Medical X-Ray Films (Sterling Diagnostic 
Imaging Inc., USA) at -80°C for 3 to 72 h. Equal loading and transfer of mRNA to the 
membrane was assessed by rehybridizing the blots with <x-( 32 P]dATP-labeled 

30 glyceraldehyde-3-phosphate dehydrogenase (GAPDH). 
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Cloning of DD-RT-PCR fragments 

Northern analysis was first performed using hybridization probes generated directly by 
PCR reamplification. Those amplified PCR fragments corresponding to differentially 
5 expressed mRNAs on a Northern blot were subcloned Subcloned fragments were isolated 
and stored for further experiments to verify differential expression. 

5' RACE PCR 

This method was applied to isolate the cDNA's of gene PKW. To identify the 5'-sequences 
10 of both transcripts a 5'RACE v^apid Amplification of cDNA Ends) PCR was performed 
following the manual as described in the 5' RACE System for Rapid Amplification of cDNA 
Ends Kit, Version 2.0 (Gibco BRL, Life Technologies). First strand cDNA was synthesized 
from total RNA (without digestion with DNase I) using the gene-specific primer GSP1 
(STTATCTTTATTCATTTTGG-S', SEQIDNO:5) and Superscript™ II, an RNAse H" 
15 derivative of the Moloney Murine Leukemia Virus Reverse Transcriptase (M-MVL RT). 
After cDNA synthesis, the solution was purified from unincorporated dNTPs and GSPL 
TdT (Terminal deoxynucleotidyl transferase) was used to add homopolymeric tails to the 
3 'ends of the cDNA. Tailed cDNA then was amplified by PCR using a nested, gene-specific 
primer GSP2(5TGCGGGACTCGTCGTAAGTATGC-3', SEQ ID NO:6),which anneals 3' 
20 to GSP1, and tLe deoxyinosine-containing abriged universal amplification primer AUAP 
(5'GGCCACGCGTCGACTAGTAC-3', SEQ ID NO:7). After several reactions with varying 
parameters also the longer cDNA (corresponding to the 2.6 kb transcript ) could be 
detected in addition to the shorter one, which appears in any reaction, ^he enriched and 
purified cDNA's were cloned and sequenced. 

25 pxaKipl$ 7 

Human Multiple Tissue Expression Array (MTE™) 

This array (Clontech, Palo Alto, CA) contains normalized loadings of poly A + -RNA from 
76 different human tissues as well as control RNAs and DNAs as revealed in Fig 5. The blot 
was hybridized with an <x-[ 32 P]dATP PKW cDNA according to the instructions of the 
30 manufacturer and exposed to X-ray film at -70°C 
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Isolation of RNA from Breast Tumor Tissues 

Total RNA was isolated from frozen tumor samples. The frozen tissues were covered with 
the suggested amount of lysis buffer and immediately disrupted and homogenized by 
5 means of a homogenizer for 45-60 sec arid 20000 U/min. The homogenized iysate was 
further processed as described in the manufacturer's protocol. 

Example 9 

Reverse Transcription Polymerase Chain Reaction (RT-PCR) 



In order to eliminate genomic DNA contamination the total RNA samples were treated 
10 with RNAse-free DNAsel at 37°C for 30 min. First strand synthesis was performed 
following the protocol of the first strand cDNA Synthesis Kit for RT-PCR (Roche 
Diagnostics GmbH, DE) by using the specific reverse primer RTR-5 
H ( 5'CCATTCATTCATTTTCAAG3 ', SEQ ID NO:8). The reverse transcription reaction was 

$ performed at 25°C for 10 min and then at 55°C for 60 min. After incubation, the AMV 

15 Reverse Transcriptase was denatured at 99°C for 5 min. For each sample a negative control 
reaction without AMV Reverse Transcriptase was performed 

The resulting single-stranded cDNA was amplified by PGR (High Fidelty PCR Master, 
Roche Diagnostics GmbH, DE) utilizing a second specific forward primer RTF-6 
(5'AAAACGCATGGCTTGTC3', SEQ ID NO:9). The amplification was performed with an 
20 initial denaturation step at 94°C for 2 min, 10 cycles of 15 s denaturation at 94°C, 30 s 
annealing at 57°C and 1 min elongation at 72°C, followed by cycles under same conditions. 
Equal loading and integrity of mRNA was assessed by a control RT-PCR with p-actin 
primers (reverse primer 5'AGGGTACATGGTGGTGCCGCCAGAC3' SEQIDNO:10 
v . forward primer 5'CCAAGGCCAACCGCGAGAAGATGAC3' SEQ ID NO:ll). 



•3 
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Patent Claims 2 6. Mai 2000 

A nucleic acid which is upregulated in mammary tumor cells, said nucleic acid being 
selected from the group consisting of: 

(a) SEQ ID NO: 1; 

(b) a nucleic acid sequence which hybridizes under stringent conditions with a 
nucleic acid probe of the complementary sequence of (a); 

(c) a nucleic acid sequence which, because of the degeneracy of the genetic code, is 
not a sequence of (a) or (b), but which codes for a polypeptide having exactly 
the same amino acid sequence as a polypeptide encoded by a sequence of (a) or 
(b);and 

(d) a nucleic acid sequence which is a fragment of any of the sequences of (a), (b) 
or (c). 

A nucleic acid according to claim 1, having a sequence which codes for a polypeptide 
of amino acids of SEQ ID NO:2 or SEQ ID NO:4. 

The nucleic acid according to claim 1, wherein the nucleic acid sequence of (b) 
hybridizes in 6.0 X SSC at about 45°C, followed by a wash with from about 2.0 X SSC 
to about 0.2 X SSC at a temperature from about 50°C to about 65°C. 

The nucleic acid according to claim 1, wherein the nucleic acid has a sequence of 
nucleotides 459 to 848 of SEQ ID NO:l or of nucleotides 1 to 285 of SEQ ID NO:3. 

An expression vector comprising a nucleic acid of claims 1 to 4. 

A host transformed by a nucleic acid of claims 1 to 4. 

A polypeptide which induces tumor progression, wherein said polypeptide is encoded 
by a nucleic acid selected from the group consisting of: 

(a) SEQ ID NO: 1; 

(b) a nucleic acid sequence which hybridizes under stringent conditions with a 
nucleic acid probe of the complementary sequence of (a) ; and 

(c) a nucleic acid which is a fragment of any of the sequences of (a) or (b). 
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8. The polypeptide according to claim 7, wherein the nucleic acid sequence of (b) 
hybridizes in 6.0 X SSC at about 45°C, followed by a wash with from about 2.0 X SSC 
to about 0.2 X SSC at a temperature from about 50°C to about 65°C. 

9. A process for detecting the presence or absence of at least one specific nucleic acid or 
5 mixture of nucleic acids, or distinguishing between two different nucleic acids in said 

sample, wherein the sample is suspected of containing said nucleic acid or acids, 
which process comprises the following steps in order: 

v (a) incubating said sample under stringent hybridization conditions with a nucleic 

acid probe which is selected from the group consisting oft 
10 (i) a nucleic acid with a sequence taken from the group consisting of SEQ ID 

NO:l or a fragment thereof; 

(ii) a nucleic acid with a sequence which is complementary to any nucleic 
acid of (i); 

j£ (iii) a nucleic acid with a sequence which hybridizes under stringent 

$ 15 conditions with the nucleic acidof(i); and 

(iv) a nucleic acid with a sequence which hybridizes under stringent 
conditions with the nucleic acid of (ii); and 
(b) determining whether said hybridization has occurred. 

20 10. The process in accordance with claim 9, wherein said hybridization conditions 
comprise 6.0 X SSC at about 45°C, followed by a wash with from about 2.0 X SSC to 
about 0.2 X SSC at a temperature from about 50°C to about 65°C. 

11. A process for determining whether or not a test sample originating from or 
containing human cells has a tumor progression potential, wherein the test sample 
25 and a second sample originating from non- tumor cells from the same individual or a 

different individual of the same species, which process comprises the following steps: 

:2 (a) incubating said sample under stringent hybridization conditions with a nucleic 

acid probe which is selected from the group consisting oft 
(i) a nucleic acid with a sequence of SEQ ID NO:l or a fragment thereof, 
30 (ii) a nucleic acid with a sequence which is complementary to any nucleic 

acid of (i); 

(iii) a nucleic acid with a sequence which hybridizes under stringent 
conditions with the nucleic acid of (i); and 
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(iv.) a nucleic acid with a sequence which hybridizes under stringent 
conditions with the nucleic acid of (ii); and 

(b) determining the approximate amount of hybridization of each respective 
sample with said probe and 

(c) comparing the approximate amount of hybridization of the test sample to an 
approximate amount of hybridization of said second sample to identify 
whether or not the test sample contains a lower amount of the nucleic acid than 
does said second sample. 
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A nucleic acid molecule (PKW) with the nucleic acid sequence SEQ ID NO:l is upregulated 
in mammary carcinoma cells. The PKW protein of SEQ ID NO:2 and SEQ ID NO:4 is also 
provided. A process for determining whether a sample contains tumor cells is provided. 
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tcc ggg agg gcg gcc ttt ccc ccc age ccc aga ggt ggg cct gec ctt 668 
Ser Gly Arg Ala Ala Phe Pro Pro Ser Pro Arg Gly Gly Pro Ala Leu 
55 60 65 70 

5 ttc cga gca tgg gac aca gcc cag gaa aac gca tgg ctt gtc etc cag 716 
Phe Arg Ala Trp Asp Thr Ala Gin Glu Asn Ala Trp Leu Val Leu Gin 
75 80 85 

aca cag gtg eta aca ggg ccg tea gac aag ggc cag gga etc agg ctt 764 
10 Thr Gin Val Leu Thr Gly Pro Ser Asp Lys Gly Gin Gly Leu Arg Leu 
90 95 100 

tta gga att tea get cca gag cca cca tgc agt ggg ace agg ggt ctg 812 
Leu Gly lie Ser Ala Pro Glu Pro Pro Cys Ser Gly Thr Arg Gly Leu 
15 105 110 115 

cgt gga cag gaa gca age tgt gta gac ggg ggt cca tgaagtagag 858 
Arg Gly Gin Glu Ala Ser Cys Val Asp Gly Gly Pro 
120 125 130 

20 

acagggtttt ggggaaggtt ggggcagggc aaggaggaaa agecacattt acagcaattt 918 
ctgaagtctt ttcatttttt ccccctgaat cacgtccata ataggatttg aatttaataa 978 
25 actgetgaag gttcctggcc ctgagtccca gtgtcctccc agcccccgcc cagctgtggg 1038 
tgtgcatggg gageggtacg agggagggta aaatgggccc ctgggacgcc gcgtgcagag 1098 
cagagatgaa tggcccgaaa ccctcgcgct gctctgcgcc cttcgtcatc cagteggggt 1158 

30 

ggttagggac tgteagagaa aaataattta gcggccatgg ctctaactga tgtgctgcat 1218 
tctggggtca aatgactttt acaaagtagt agtgctgcct ggtttctcta tegtgagage 1278 
35 tcagggctga taacatgaaa gaaaaaggca ctgcagccag aattcactga cattcttcac 1338 
atttcacatg agtgggacgc aggagggggg ctggggaggg tggagggatg tttcctgett 1398 
aacagattca acagaagagt ggcaggctca gctgggtgag caaggtatcc cagegaeggg 1458 

40 

ggacacgccc cagaccatgg gtggtggggc ttctcagagg aggtggcagg agacccgagc 1518 
ctgccaaggt tgcacctaag gtcaegggea gcattaggag ggctctctcc cagtctcccc 1578 
45 acccccccgt cccccctccc ccaggctgca ggggtgaagt ggcttccagg aeggtcactg 1638 
gcaagtttaa gctacagaga gtgtagaaac agggtgaaaa aggaagagag aggggagtaa 1698 
ataagaagga ggtgtaagaa aagaccaagc caggccccag cgcccttgtg aggaagtgee 1758 

50 

cagggactta tgtggaagcc gtccttgctg tctgccacct tgtttttact tacattgtgt 1818 
ttttatttga gggcgagttt ggacggcaag actgatggag attgtggtct aaatgectet 1878 
55 aacccactcc ttaaaatgac caceggatgt tccacaagta cttgaaaatg aatgaatggc 1938 
ttcccgagag gcagaaggca ggggtgtgcc ctaccccacg ccggccaaga gttcaacaag 1998 
cattggttga caagtgaata gtgagcactt gaacccagtc acaattcaag atgagggctc 2058 
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tgccatgacg catgtggtct gtgtcaccct gcagtctccc tgagcagtgt ctgaggttcg 2118 
agtgggaccc tacattcgtg aagagattta tcatctcccc aggaaaaata acagattctg 2178 

5 

tcctaggtgt tgtgatgtaa caatggtagc gatcacagcc ataacttaca attattgcat 2238 
acttacgacg agtcccgcac tgggctaagt gctttttaac tatgtgaaat gtttctttcc 2298 
10 ttgattgatg ccaaaatgaa taaagataat tttctgtatc tgct 2342 



<210> 2 
<211> 130 
*:> 15 <212> PRT 



<213> Homo sapiens 
<400> 2 

Met Asp Pro Asp Val Val Leu Trp Ser Cys Thr Trp Lys Pro Ala Leu 
20 1 5 10 15 

Arg Gly Val Ser Leu Gly Leu Trp Ala Glu Asn Leu Lys His Arg Ala 
20 25 30 

25 Gly Thr Gin Val Gin Arg Leu His Arg Pro Ser Arg Arg Arg Cys Phe 
35 40 45 



30 



# 45 



Gin Ala Pro Trp Thr Asp Ser Gly Arg Ala Ala Phe Pro Pro Ser Pro 

50 55 60 

Arg Gly Gly Pro Ala Leu Phe Arg Ala Trp Asp Thr Ala Gin Glu Asn 

65 70 75 80 



Ala Trp Leu Val Leu Gin Thr Gin Val Leu Thr Gly Pro Ser Asp Lys 
35 85 90 95 

Gly Gin Gly Leu Arg Leu Leu Gly lie Ser Ala Pro Glu Pro Pro Cys 
100 105 110 

40 Ser Gly Thr Arg Gly Leu Arg Gly Gin Glu Ala Ser Cys Val Asp Gly 
115 120 125 



Gly Pro 
130 





<210> 


3 




<211> 


285 


50 


<212> 


DNA 




<213> 


Homo 




<220> 






<221> 


CDS 


55 


<222> 


(1) . 




<400> 


3 
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atg gat cca gac gtg gtt ttg tgg tec tgc acg tgg aag cca gec ctg 48 

Met Asp Pro Asp Val Val Leu Trp Ser Cys Thr Trp Lys Pro Ala Leu 
15 10 15 

*i 5 cgt ggg gtg age ctg gga ctg tgg gca gag aac etc aag cac egg gec 96 

Arg Gly Val Ser Leu Gly Leu Trp Ala Glu Asn Leu Lys His Arg Ala 
$ 20 25 30 

ggc acc caa gtg cag aga ctg cat cgt ccc aac agg agg cgc tgc ttc 144 
10 Gly Thr Gin Val Gin Arg Leu His Arg Pro Asn Arg Arg Arg Cys Phe 
|j 35 40 45 

cag get ccc tgg acg gac tec ggg agg gcg gec ttt ccc ccc age ccc 192 
Gin Ala Pro Trp Thr Asp Ser Gly Arg Ala Ala Phe Pro Pro Ser Pro 
^ 15 50 55 60 

aga ggt ggg cct gec ctt ttc cga gcg tgg gac aca gec cag gaa aac 240 
;j Arg Gly Gly Pro Ala Leu Phe Arg Ala Trp Asp Thr Ala Gin Glu Asn 

65 70 75 80 



20 



25 



gca tgg ctt gtc etc cag aca cag ggc gag ttt gga egg caa gac 285 
Ala Trp Leu Val Leu Gin Thr Gin Gly Glu Phe Gly Arg Gin Asp 
85 90 95 



30 



<210> 4 
<211> 95 
<212> PRT 
<213> Homo sapiens 

<400> 4 

Met Asp Pro Asp Val Val Leu Trp Ser Cys Thr Trp Lys Pro Ala Leu 
15 10 15 

35 Arg Gly Val Ser Leu Gly Leu Trp Ala Glu Asn Leu Lys His Arg Ala 
20 25 30 



40 



Gly Thr Gin Val Gin Arg Leu His Arg Pro Asn Arg Arg Arg Cys Phe 
35 40 45 

Gin Ala Pro Trp Thr Asp Ser Gly Arg Ala Ala Phe Pro Pro Ser Pro 
50 55 60 



Arg Gly Gly Pro Ala Leu Phe Arg Ala Trp Asp Thr Ala Gin Glu Asn 
45 65 70 75 80 



Ala Trp Leu Val Leu Gin Thr Gin Gly Glu Phe Gly Arg Gin Asp 
85 90 95 



50 



<210> 5 
<211> 19 
<212> DNA 
55 <213> Artificial Sequence 

<220> 

^ <223> Description of Artificial Sequence : primer GSP1 
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<400> 5 

ttatctttat tcattttgg 19 



<210> 6 



\ <211> 23 

<212> DNA 

<213> Artificial Sequence 
10 <220> 

<223> Description of Artificial Sequence : primer GSP2 
; <400> 6 

I tgcgggactc gtcgtaagta tgc 23 

«f is 

I 

<210> 7 
<211> 20 
<212> DNA 
20 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence : primer AUAP 

M 25 <400> 7 

ggccacgcgt cgactagtac 20 

*8 

<210> 8 
30 <211> 19 
<212> DNA 

<213> Artificial Sequence 

i <220> 

35 <223> Description of Artificial Sequence: primer RTR-5 

<400> 8 

ccattcattc attttcaag 19 

40 

<210> 9 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer RTF- 6 
<400> 9 

50 aaaacgcatg gcttgtc 17 



45 



<210> 10 
<211> 25 
55 <212> DNA 

<213> Artificial Sequence 



Printed:07-02-2001 



5 
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Small transcript of gene PKW 
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512 bp 



5-| 
5-1 



3-3* (1235 bp) 



I -3' (2342 bp) 



723 bp 1107 bp 

Large transcript of gene PKW 



512 bp 



B 



Protein encoded by small transcript of gene PKW 



88 aa 



7 aa 



(95 aa) 



■mi 



(130 aa) 



88 aa 42 aa 

Protein encoded by large transcript of gene PKW 
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